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Abstract 

We consider a multidimensional diffusion X with drift coefficient b(a,Xt) and diffusion 
coefficient ea(/3,Xt). The diffusion sample path is discretely observed at times = kA for 
k = l..n on a fixed interval [0, T]. We study minimum contrast estimators derived from the 
Gaussian process approximating X for small e. We obtain consistent and asymptotically 
normal estimators of a for fixed A and e — > and of (a, (3) for A — > and e — > without 
any condition linking e and A. We compare the estimators obtained with various methods 
and for various magnitudes of A and e based on simulation studies. Finally, we investigate 
the interest of using such methods in an epidemiological framework. 

Keywords: Minimum contrast estimators, low frequency data, high frequency data, 
epidemic data. 



1. Introduction 

In this study we focus on the parametric inference in the drift coefficient b(a,X^) and 
in the diffusion coefficient ea(^,Xf) of a multidimensional diffusion model (X^) t>0 with 
small diffusion coefficient, when it is observed at discrete times on a fixed time interval in 
the asymptotics e — > 0. This asympto tics has been widely studied and has proved fruitful 



in applied problems, see e.g. [FREI84I ]. Our interest in considering this kind of diffusions 
is motivated by the fact that they are natural approximations of epidemic processes. In- 
deed, the classical stochastic SIR model in a closed population, describing variations over 
time in Susceptible (S), Infectious (/) and Removed (R) infividuals, is a bi-dimensional 
continuous-time Markovian jump process. The population size (iV) based normalization of 
this process asymptotically leads to an ODE system. Before passing to the limit, the forward 
Kolmogorov diffusion equation allows describing the epidemic dynamics through a bidimen- 
sional diffusion, with diffusion coefficient proportional to 1/s/N. Moreover, epidemics are 
discretely observed and therefore we are interested in the statistical setting defined by dis- 
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crete data sampled at times tk = kA on a fixed interval [0, T] with T = nA. The number 
of data points is n and A, the sampling interval, is not necessarily small. 



Historically, statistics for diffusions were devel oped for co ntinuo usly observed processes 



leading to explicit formulations of the likelihood ([KUT8J], LIPOll ]). In this context, two 
asymptotics exist for estimating a for a diffusion continuously observed on a time interval 
[0, T): T — > oo for recurrent diffusions and T fixed and the diffusion coefficient tends to 0. 
In practice, however, observations are not continuous but partial, with various mechanisms 
underlying the missingness, which leads to intractable likelihoods. One classical case con- 
sists in sample paths discretely observed with a sampling interval A. This adds another 
asymptotic framework A — > a nd raise s the question of estimating paramet er s in the dif- 
fusion coefficient (see |GEN93| . lYOSg^ for T fixed and |HANS98j . |kES0(| . [UCH12( | for 
T — »■ oo). 



Since nineties, statistical methods associate d to disc r ete data h ave been developed in the 
asymptotics of small diffusion coefficient (e.g. LAR90| . GEN90 ], UCH04 ]). Considering a 
discretely observed diffusion on M with constant (= e) diffu sion coe fficient, Genon-Catalot 
(1990) obtained, using the Gaussian approximating process |AZE82j [. a consistent and e - 
normal and efficient estimator of a under the condition {e — > 0, A — > 0, e/\/A = 0(1)}. 
The author ad ditionall y proved that this estimator possessed good properties also for A 
fixed. Uchida UCH041 ] obt ained si milar results using approximate martingale estimating 
equations. Then, S0rensen SOROOl ] obtained, as e — > 0, consistent and e -normal estima- 
tors of a parameter 9 present in both the drift and diffusion coefficient, with no assumption 
on A, but under additional conditions not verified in the case of distinct paramet ers in the 
drift and diffusion coefficient. For this latter case, S0rensen and Uchida SOR03| obtained 
consistent and e -1 -normal estimators of a and consistent and %/n- normal estimators of ft 
under the con dition A /e — > and vA/e bounded. This result was later extended by Gloter 
and S0rensen GLO09I ] to the case where e _1 A p is bounded for some p > 0. Their results 
rely on a class of contrast processes based on the expansion of the infinitesimal generator of 
the diffusion, the order of the expansion being driven by the respective magnitude of e and 
A and requiring this knowledge (value of p), which might be a drawback when applying the 
method. Moreover, this contrast becomes difficult to handle for values of A that are not 
very small with respect to e. 

To overcome this drawback, we co nsider a si mple con trast based on the Gaussian approx- 
imation of the diffusion process X e ((AZEsij, |FREI84j l)- Contrary to Gloter and S0rensen 



GLO09I ]. our contrast has generic formulation, regardless to the ratio between A and e. 



Thus, the standard balance condit ion betw een e and A of previous works is here removed. 
Our study extends the results of |GEN90l ] to the case of multidimensional diffusion pro- 
cesses with parameters in both the drift and diffusion coefficient. We consider successively 
the cases A fixed and A — > 0. We obtain consistent and e _1 -normal estimators of a (when 
ft is unknown or equal to a known function of a) for fixed A. For high frequency data, 
we obtain results similar to GLO09I ]. but without any assumption on e with respect to A. 
The estimators obtained are analytically calculated on a simple example, the Cox-Ingersoll- 
Ross (CIR) model. Finally, they are compared based on simulation st udies in the case of 
a financial two-factor model LON95 ] and of the epidemic SIR model DIE0d |. for various 
magnitudes of A and e. 

The paper is structured as follows. After an introduction, Section 2 contains the nota- 
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tions and preliminary results on the stochastic Taylor expansion of the diffusion. Sections 
3 and 4, which constitute the core of the paper, present analytical results, both in terms of 
contrast functions and estimators properties. We investigate in Section 3 the inference when 
A fixed and e — > in three contexts depending on whether the parameter (3 in the diffusion 
coefficient is unknown, equal to a known function of a (with the special case (3 = a) or 
whether the diffusion coefficient is multiplicative. Section 4 is devoted to the case A — > 0. 
Results are applied in Section 5 to the CIR model. Moreover, the different estimators ob- 
tained are compared based on numerical simulations to the minimum contrast estimator of 



Gloter and S0rensen GLO09I ]. mainly in the context of epidemic data. 



2. Notations and preliminary results 

Let us consider on a probability space (f2, A, (At)t>o, P) the p-dimensional diffusion 
process satisfying the stochastic differential equation 

dX\ = b(a, Xt)dt + ev(J3, XfidBt 

Xq = Xq, 

where xq S MP is prescribed, e > 0, = (a, (3) are unknown multi-dimensional parameters, 
b(a, x) is a vector in MP, o~{f3, x) is a p x p matrix and (Bt)t>o is a p-dimensional Brownian 
motion defined on (0,^4). 

Throughout the paper we use the convention that objects are indexed by 9 when there 
is a dependence on both a and (3 and by a or (3 alone otherwise. Let us denote by M P (M) 
the set of p x p matrices, and by t M, Tr(M) and det(M) respectively the transpose, trace 
and determinant of a matrix M. 

We denote the partial derivatives of a function f(a,x) in (ao,Xo) by f^(ao,£o) an d 

^(a>o,%o)- Moreover, if x = x(a,t) the derivative of the function 

a — > f(a,x(a,t)) in ao will be denoted by 

e/(«gC«,«)) (ao ) = f£( ao , x ( ao ,i)) + g(ao,a;(ao,t))g(ao,*). 

We set 

E(P,x) = a(l3,x) t a(/3,x). (2.2) 

In what follows, we assume that A = sup(At, t > 0), (At)t>o 1S right-continuous and 

{(i) 3U, open set of M p such that, for small enough e, Vt £ [0, T], X\ £ U 
(ii) b(a,-) G C 2 ([/,RP),a(/3,-) gC 2 (C/,A^ p ) 
(m) 3K > 0, ||6(a, x) - b(a,y)\\ 2 + ||ct(/5,x) - a(/3,y)|| 2 < K\\x - y\\ 2 
(H2) Vx G 17, E(/3,a;) is invertible 

Assumptions (HI) and (H2) ensure existenc e and uniqueness of a strong solution of (|2.ip . 
with infinite explosion time (see e.g. IKE89I ]). 



2.1. Results on the ordinary differential equation 

Consider the solution x a (t) of the ODE associated with e = in ()2.1 

dx a (t) = b(a, x a (t))dt 



x a (0) = x £ MP. ^ 
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Under (HI), this solution is well denned, unique and belongs to C 2 (U, MP). Let us consider 
the matrix $ a (-,£o) G Mp, solution of 



d&g U 4- \ — db 
$a(*0,*o) = Ij 



(t,t ) = §(a,x a (t))$ a (t,to) 



(2.4) 



p- 



Under (HI), it is well known (see e.g. [CAR71I ]) that, for t G [0,T], $<*(•, *o) is twice 



continuously differentiable on [0, T] and satisfies the semi-group property 

V(t , h,t 2 ) G [0, T] 3 , $ tt (t 2 , to) = $ a (t 2 , ti)® a (ti, t ). (2.5) 

A consequence of (|2.5p is that the matrix $ a (ii,io) is invertible with inverse 3> Q (to>ti)- 



2.2. Taylor Stochastic expansion of the diffusion (Xf) 

We use in the sequ el some known results for small perturbations of dynamical systems 
(see jFREI84l . lA~ZE82l p. The family of diffusion processes (Xf, t G [0,T]) solution of (gZQ 



satisfies the following theorem. 
Theorem 2.1. Under (HI), 

X\ = x a (t) + eg e (t) + e 2 R 2 e ' e (t) with sup {\\eR 2 g ' e (t)\\} — > in probability (2.6) 

te[o,T] 

and wi/i x a (-) defined in \2. 3\) and g$(t) satisfying 

db 

dge(t) = ^(a,x a (t))g e (t)dt + a{P,x a (t))dB t ,with g e {0) = 0. (2.7) 
Remark 2.1. VFe use also in sequel the Taylor expansion of order 1 

X\ = x a (t) + eR l e ,e (t) with sup ||ei?g' e (t)|| — > in probability. (2-8) 

te[o,T] 

Corollary 2.1. Under (HI), i/ie process ge(-) is the continuous Gaussian martingale on 
[0, T] defined, using {2.J$ , by 

9o(t)= [ $ a {t,s)a(P,x a {s))dB s . (2.9) 
Jo 

Proof. Using (|2.5|) . the matrix ^(t, 0) is invertible with inverse <l> a (0,i). The process C(t) 
defined by g e (t) = $ a (t,0)C(t) satisfies, using fllL7|), dC(t) = $ a (Q,t)<r(/3,x a (t))dB t and 
C(0) = 0. Thus, applying ([23]) yields (jZU). □ 

Corollary 2.2. Assume (HI). //, moreover, b(a,.) anda(f3,.) have uniformly bounded 
derivatives on U , then there exist constants only depending on T and 9 such that 
(i)Vte[0,T], E[(]||^' £ (t)|| 2 )<C 1 , 



(%%) Vi G [0,T], as h -»• 0, E ||^' e (t + h)-R t 
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The result O-(i) is given in (FRETS' (Theorem 2.2 p. 56), proof ofH^(ii) is g 
Appendix 17.21 



lvcn m 



An important consequence of Corollary 12.11 is the following lemma on the Gaussian process 
go. Let us define 

Z{ = ^=[ k <S> a (t k ,s)a((3,x a (s))dB s , (2.10) 
V A 

S k '" = ^ jf * $ a (t k , x a (s)) % a {t k , s)ds. (2.11) 

Lemma 2.1. Under (HI), the random variables ge(t k ) ver tfy, f or ifc = kA, k = 1, . . . ,n, 

ge{t k ) = ^ a (tk,t k -i)ge{tk-i) + ^AZ e k , (2.12) 

where (Zl)i<h<n defined in 112. 1 0\) is a sequence of MP- dimensional independent centered 
Gaussian random variables, At k - measurable and with the covariance matrix Sp^ 

Proof. Using (|2.9p and the semi-group property of $ a (t, s) yields 

9e(tk) = $a(tk,tk-i)go{tk-i) + ft*_ 1 $a(tk,s)o-(/3,x a (s)) dB s . The proof is achieved by 
identifying in this relation. □ 

Note that (HI) and (H2) ensure that S^'^ is positive definite matrix. 



2. 3. Statistical framework 

Let C = C([0,T],R p ) denote the space of continuous functions defined on [0, T] with 
values in M. p endowed with the uniform convergence topology, C the a-algebra of the Borel 
sets, (Xt) the canonical coordinates of (C,C) and T± = o~(X s ,0 < s < T). Finally, let 
P e e = P e a a be the distribution on (C,C) of the diffusion process solution of (12. lj) . 
From now on, let 6q = («o>A)) G be the true value of the parameter. We assume 

(51) (a, p) G K a x K b = with K a , K b compacts sets of R a , R b ; 6 G 

(52) (H1)-(H2) hold for all (a, (3) G with constant K not depending on 

(53) The function b(a,x) is C 3 (K a x U,W) and <r(p,x) G C 2 (K b x U,M P ) 

(54) A -> 0: a ± a' =>- b(a,x a (-)) / b{a',x a ,{-)) 

(S4') A fixed: a / a' => {3k, 1 < k <n, x a (t k ) ^ x a >(t k )} 

(55) /3 ^ p S(/3,x Q0 (-)) / £(/3',x Q0 (-)). 

Assumptions (S1)-(S3) are classical for the inference for diffusion processes. The differ- 
entiability in (S3) comes from the regularity conditions required on a — > § a (t,s). Indeed, 
(S3) on b{a, x) ensures that $ a (i,t ) belongs to C 2 {K a x [0,T] 2 , M p ) (see Appendix O for 
the proof). (S4) is the usual identifiability assumption for a continuously observed diffusion 
on [0,T]. Note that (S4) ensures that, for A small enough, (S4') holds. 

For a sample path y(.) G C([0, T], MP), let us define the quantity depending on x a (.), 
$«(., .) and on the discrete sampling (y tk , k = 1, . . . , n), 

N k (y,a) = y{t k ) - x a (t k ) - ® a (t k ,t k -i)(y(t k -i) - x a (t k -i)). (2.13) 
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Note that Nk(x a ,a) = 0. Let us also define the Gaussian process (Xt)t&[o,T\ £ C, 

Y t € = x a {t) + eg e (t). 
Using (|2.12j) and (|2.13p . we can express the random variables Z e k using Y t e , 

= y A -j^> = Nk{ X^l, a) . (2 , 4) 

eyA eyA eyA 



Then, the n-sample (Y tk ,k = l,...,n) has an explicit loglikelihood l(a, (3; (Y tk )) which 
is, using (j!TT3j) and QZTQ, 

1 n 1 n 

k=l k=l 



3. Parametric inference for fixed sampling interval 



For the diffusion parameter /3, all existing results for discretized observations on a fixed 
sampling interval are provided in the context of the asymptotics A — )■ (T = nA). In 
this section we focus on a different asymptotics (e — > 0) as A is assumed to be fixed. We 
build a contrast process based on the the functions Nk(X,a) defined in (2.13). Except 
for some specific cases (e.g. linear drift in the diffusion process), the two deterministic 
quantities x a (.),<& a (., .) appearing in the N^'s are not explicit and are approximated by 
solving numerically an ODE with dimension p x (p + 1). 



3.1. One- dimensional Ornstein- Uhlenbeck process 

The one dimensional Ornstein-Uhlenbeck process is an appropriate illustration of the 
limitations imposed by the assumption A fixed. Indeed, assuming that a is known and 
equal to ao, the diffusion process (^t)tg[o,T] following dX t = aoX t dt + e[3dB t ,Xo = xq £ E 
is equal to its Gaussian approximation (X t = x ao (t) + eg aQ) p(t)), and l(ao,/3) is then the 

log-likelihood of Gaussian observations. Noting that = [3 2 - 2ao A > Xa o(t) = x o eaot 

and < l ) Q0 (tfc) tk-i) = e a ° A , the maximum likelihood estimator of /3 is given by 



PlA= e2 J a 2_ 1) ±(Xt k -e a ° A X tk J 



k= 



2 



Under P, , /3 e 2 A = #£V fc 2 , where U% = ( P e a °^dB s ) . 

k=i V^- 1 / 

Hence, (Uk)i<k<n are i.i.d. random variables A/"(0, 1), and /3 2 A is unbiased for all e but has 
no other properties as e — > 0. 
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3.2. General case (ft unknown) 



In the case where we have no information on ft, it is quite natural to consider a contrast 
process derived from the conditional least squares for (Y tk ), which does not depend on ft 
and is defined using (I2.4p and (12.131) by 



n 



U eA (a;(X tk )) = U eA (a) = ^W fc (X,a)JV fc (X,a). (3.1) 

k=l 

Then, the conditional least square estimator is defined as any solution of, 

«e,A = ar grain £/ £) a {Xt k )) ■ (3-2) 

aeK a 

Let us also define, 

1 n 

K A (a ,a) = — ^*iV fc (x ao ,a)iV fc (x ao ,a). 
fc=i 

Clearly, K&(ao,a) > and Ka(o:o>«o) = 0. Now, KA(otQ,a) = if for all k, 
x a (t k ) - x ao (t k ) = 4> Q (tfc,tfe_i)(x Q (t fe _i) - x ao (tk-i)). The matrix $ a (t k ,t k -i) being 
invertible, this is the idenfiability assumption (S4'). 

Lemma 3.1. Assume (SI), (S2). Then, under ¥g , 

U e &{a) — > K^(ao,a) in probability. (3-3) 

Using (|2.13p and the stochastic Taylor formula (|2.6p the proof is immediate. 
In order to study a e a, we define for 1 < i < a and for 1 < k < n, 



D kj i(a) = — 



— (a) + <P a (t k , t k _i) (a) 

OOLi OCii 



(3.4) 



and 



M A (a) = I A^'L> M (a)D fcj (a) ) e M a (R). (3.5) 

Proposition 3.1. Assume (S1)-(S3) and (S4'). Then, under ¥g , 

(i) a e a — ► «o * n probability. 

' e— >0 

(ii) If M^(fto) «s invertible, e _1 (a e A — «o) — >■ A/TO, J7 1 (ao ) A))) distribution, with 

JA(ao,fto) = M A ( a o)(A^ t D kil ( ao )S d k D kj3 (ao)] *M A (a ). (3.6) 

V fc=l / l<ij<a 

Proof. The proof of (i) is classical and relies on the control of the continuity modulus of 
^A,e(«) Xt k (u))) (see Appendix 17.31 for details). 

Let us just study ~ 9U Qa ^ • Expanding dU g^^ in Taylor series at point oq yields 



l dU t {a ) , 
e da 



d 2 U, 
da 2 



f 1 ( d 2 U e d 2 U e \ 

(q ) + J [-Q^r( a o + *( a e,A - a )) - ~g^( a o)j dt 



e 1 (a e - a ). 
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1 dUejag) 
e da 



First, we study the term 
Using ([22D and (l2ll)jl . 

1 



-N k {X, ao) — > ^° in P^-probability. (3.7) 

Now, by ([233]) and (33D, ^g^(ao) = AD^ao) + 9<M ^ fe - l} (ao) [X^ - x ao (t k ^)}, 
and based on (|2.8|) we obtain 

5 ("oj — > A/; fc i (ao)- (3.8) 

By Slutsky's Lemma, ^ ^%^^( a o)) (^a^O^ a o)) ^ D kti (a )Z e k a , and definition 
(|2.1ip yields that, under F 0Q , as e — >■ 0, 

7^ (ao) ^ ^ (°' 4A (^^(ao)^ '*^'^) J I m distribution. 

In Appendix 17.31 we prove, by using the matrix defined in (|3.5p . that Fq - a.s., 

d 2 U e 

- — ^—(a ) — > 2M A (a )i j and 
oaiOaj e->o 

sup ||^Sr(ao + ^(«e,A — «o)) — 7^( a o)|| — 51 0, which completes the proof of (ii). □ 

te[o,i] a 

It is well known that the Fisher Infor mation m atrix for a continuously observed diffusion 
in the asymptotics of e — > is (see e.g. |KUT84^ 



/ f Tt db db \ 

h(ao,/3 ) = i -K— (a ,x ao (s))S" 1 (/3 ,3; Qo (s))^— (a ,x ao (s))ds j . (3.9) 

f f Tt db db \ 

Setting Fb(ao,M) = / — — (ao, x ao (s))M(s) — — (ao, x ao (s))ds ) , we have that 

\Jo dai daj n<i,j<a 

M A (a ) ->■ F b (a ,I p ) and Ja(«o,/3o) -> ^("o, ^ P )(^fe(ao, S(/3 , x ao (-)))) -1 %(a , I P ) as 

A —7- 0. This is different from the Fisher Information matrix 

/b(ao,/3o) = Ffe(ao, S _1 (/3q, x Qo (•))), but possesses the right rate of convergence. 



3.3. Case of additionnal information on /3 

In this section we will consider successively the case where (5 is a known regular function 
of a and the multiplicative case for parameter (3 which applies to Ornstein-Uhlenbeck or 
Cox-Ingersoll-Ross models for examples. In the former context, one particular subcase, 
interesting in applications, is given by a = (see Section I5T31) ) . 
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3.3.1. Case of (3 = f(a), f known 

In many applicative situations, such as the modelling of epidemic spread (see Section [5^3|) . 
we have ft = a. Using a contrast depending on ft through a leads to the optimal asymptotic 
Information. We regroup these cases in a more general formulation with ft = f(a), where f 
is known and regular. 

Since the Gaussian process (Yj) is a good approximation of (Xt) for small e (Theorem 
12. ip . we use the likelihood (|2.15p to derive a contrast process for (X tk ). The sampling 
interval A being fixed, the first term of fl2 . 15f> converges to a finite limit as e — > 0. This 
leads to the contrast process, using (|2.1ip 

n 

U As€ (a;(X tk )) = iY, tN ^ X ^ a ^ S k fia) )' lN k(X,a). (3.10) 

k=l 

Then, we can define 

<5e,A = argmin U Si a {a, {X tk )) . (3-H) 

a€K a 

Clearly, under (51) — (S3), UA,e{a) — > Ka{olq,o), where 

n 

k A (a ,a) = -^Nki^aXS^^Nkix^a). (3.12) 



fc=i 

Assumption (S4') ensures that A'a(«0j q ) is non negative and has a strict minimum at 
a = a.Q. 

Proposition 3.2. Assume (S1)-(S3),(S4'). Then, 

(i) a €i A — > in Fg -probability. 

(ii) If Ia((*o, fto) is invertible, e -1 (a e A — «o) — > -A/"(0, /7 1 (ao, fto)), under¥g in distribu- 

e— »0 

tion, with 



lA(a ,fto) = A I J^*D M (a ) [S e k °) D kJ (a ) (3.13) 

\k=l J l<i,j<a 



The proof of (i) is a repetition of the proof of Proposition ()3.ip . The proof of (ii) 
relies again on the two properties. Under Fq , e -1 dU a A ' e (ao) — ► A/"(0,4Ia(o!o, fto)) in 

distribution and - ' qJ}^ («o) — * 2i A( a 0i fto) m probability. Contrary to Proposition 13.11 

additional terms appear due to the derivation of S^'^ a \ Those terms are controlled 
using a — > <& a (tk,t) and a — > T,(f(a),x a (t)) regularities. Details of the proof are given in 
Appendix 17.41 



Remark 3.1. Contrary to the previous contrast \3. the Covariance matrix is asymptotically 

optimal in the sense that /a (<*()> A)) — ► h(oto,fto) where 1^ is define in l\3.9\) . 

A— >0 
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3.3.2. The multiplicative case (E(ft,x) = f(ft)Tio(x)) 

The case of S(/3, x) = /(/3)So(x) with /(•) a stricly positive known function of C(M 6 , 
often occurs in practice. Noting that S% = f(ft)S^° with 

^k'° = ~k / ^a{tk, s)T l Q(x a (s)) % a (tk, s)ds. Define the contrast process 

n 

UA,e (a;(X tk )) = ^^(^aX^'V^pr^), then 
fc=i 

Corollary 3.1. Assume (51) - (S3) - (S4f), Then, under F 8a , as e -)• 0, 

(i) a e ,A — > olq in ^e -probability. 

(ii) If I&(olq, fto) is invertible, e _1 (a e A — «o) — > -A/"(0> -^A 1 ( a 0) A))) distribution. 

e-S>0 

Its study is similar to /3 = /(a), with a substitution of S?'^ a ' by S^' in (|3.10p . 



4. Parametric inference for small sampling interval 



We assume now that A — A n — y 0, so that the number of observations 11 — T/A n 
goes to infinity. The results obtained by Gloter and S0rensen state that, under 

the additional condition (Bp > 0, A^/e bounded), the rates of convergence for a, ft are 
respectively e _1 and , . Indeed, considering the one dimensional Ornstein-Uhlenbeck 

V A n 

process the estimator ffi A obtained in Section 3.1 is still the MLE, is consistent and satisfies 
n (Pi a " ft) ^ o AA(0,2/3 4 ). 



In the sequel, we follow SOR03| and GLQ09 ]. which allow to study contrast estima- 



tors of parameters which converge at different rates: we prove the consistency of a ej A in 
Proposition 14.11 and the tightness of the sequence (<5: e A — a o)/ e m Proposition 14.21 From 
this, we deduce the consistency of /3 £> a in Proposition 14.31 Asymptotic normality for both 
estimators is finally proved in Theorem 14.11 
For clarity, we omit the index n in A n . 

Using (I2.13p . let us consider now the contrast process (f t ,A ((ct,ft), (Xt k )) = U e ,A(ct, ft) 

n n 

U eA (a,ft) = ^log detS(/3,X 4fe _ 1 ) + -^-^^ fe (X,Q)S- 1 (/3,X tfc _ 1 )iV fc (X,a). (4.1) 

k=l ' k=l 

The minimum contrast estimators are defined as any solution of 

(a e ,A, fte,A) = argmin U £j A(a,ft). (4.2) 
(a,p)ee 

For studying these estimators, we need to state some lemmas on the behaviour of N^(X, a). 
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4-1- Asymptotic properties of N k (X,a) 

Clearly, as e goes to zero, Nk(X,a) converges to N k (x ao ,a) by (12, 8|) under Pq . Let us 
define the function 

db 

r(a ,a;t) = b(a ,x ao {t)) - b(a,x a (t)) - —(a,x a (t))(x ao (t) - x a (t)) G R p . (4.3) 

Then, functions (N k (x ao ,a)) k<n satisfy 
Lemma 4.1. Under (S2) 

,,N k (x ao ,a) 

su p II a r(«o,a;*fc-i)|| — ->■ o. 

fee{l,..,n},aeif a A 
Proof. First, note that N k (x ao ,a) defined in fj2. 13 j) writes 

N k (x ao ,a) = (x ao (t k ) - x ao (t k -i))-(x a (tk) - x a (t fc -i))-[^ Q (ifc,ife-i) - ip] (x Qo (t fc _i) - x a (t k -i)) 

(4.4) 

Hence, using fl4.3f) we have 
S^VfcC^ao) 01 ) = r(a 0) a;*fc-i) + s (x ao (tk) ~ x ao (tk-i)) - b(a ,x ao (t k -i)) 

~S (^a(ifc) - Za(*fc-l)) + X Q (4_i)) 

+ ($ Q (t fc ,tfc_i) - ip) - ^(a,x a (i fc -i))] (»a (*fc-l) - ^a(tfc-l)) • 
The uniform approximation is then obtained using the analytical properties (|7.1|) . (|7.4p of 
2) a and $ a given in Appendix 17. 11 □ 

Let us now study the properties of N k {X, a). 
Lemma 4.2. Assume (S1)-(S3). Then, under P$ , for all k (1 < k < nj, 



^ [-/V fc (X,a) - iV fc (X,a )] = — N k (x ao , a) + e||a - a ||%, 



where rj k = rj k (ao, a, e, A) is J r t k _ 1 -measurable and satisfies that, under ¥g , as e, A — >• 0, 

sitp ||?7fc|| is bounded in probability. 
ke{l,-,n},aeK a 



Proof. Using flSgi and (l2J3|) . N fc (X,a) writes 



N k (X, a) = N k (X, a ) + N k (x ao ,a) + ($ ao (i fe , t fc _i) - *fc-i)) eRl' e (t k ^) . 



Applying f|7.1|) yields that 

sll*ao(**,tfc-i) - $ a (tfc,tfc-i)|| < 2||g(ao,a;ao(tfc-i)) - §(«,a;a(tfe-i))|| < K\\a - a \\. 
Assumption (S3) ensures that (t,a) — > ^(a,x a (t)) is uniformly continuous on [0, T] x A' a , 

and (|2.6p that sup ||-Rg' e (t)|| is bounded in probability under Pg . The proof is achieved 

te[o,T] 

setting n k = ® a °^ k '\^-to\\ tk ' tk ~^ ' ^lo^k-i) and noting that i?^ e (i fe _i) is F tk _ 1 -measurable. 

□ 



The following Lemma concerns the properties of N k (X, ao) 
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Lemma 4.3. Assume (S1)-(S3). Then, under ¥g , 

N k (X,a ) = ea^Xt^) {B tk - fl tfc _i) + E k , 
where E k = E k (a ,Po) satisfies, for m > 2, E [\\E k \\ m \ JF tk _A < Ce m A m . 



The proof of Lemma 14.31 follows the proof of [GLO09I ] and is given in Appendix 17.51 



The properties of the derivatives of N k (X,a) are given in the following Lemma. 

Lemma 4.4. Assume (S1)-(S3). Then, for all i,j, 1 < i,j < a, as e, A — > 

(i) 7£ dN dai ( a o) = ~^( a o,gaofa-l)) + e(k,i + n,i where 

Ck,i = Cfc,i( a 0)e, A) is J r tk _ 1 -measurable and satisfies that sup \\Ck,i\\ ^ s bounded 

fce{i,..,n} 

in ¥g a -probability as e, A — > 0, and r k ^ = r k ^(a.o, A) is deterministic and satisfies 

sup \\r k j|| — > 0. 
fee{i,..,n} ' 

5 ^ da ^da 01 ^ ( a o) ^ s bounded in¥g -probability. 

Proof. Let us first prove (i). Using (|2.8p and (|4.4p . we get 

^T^(«o) = -%g^(«o) + ^gf^H) - [*«„(**, t*-i) " / P ] ^^(«o) 

Set C*,< = ^ ^"fa^-i) (a )-R^ e fa-i). Using pS} and {L2]) we obtain, as A -> 0, that 

II 5 ^"tf' Mil < 2|| ga6(a g£*- l)) (ao,Xa (t fc -i))||, «*P ||<(t)|| is bounded in P, - 

te[o,T] 

probability. It remains to study the deterministic part 

Ek, = -%^(«o) + ^#^(ao) - [*«o(t*,t*-i) " /J ^^(«o). 
According to (JT7TJ) and (|73|) . as A ->■ 0, 

i(*«o(tfc,tfc-l)-ip) (resp. s ~ ^ aar'O ( Q o) ) is approximated by 

g(a ,Xa (t fc _i)) (resp. - ^-i)) ^) ). Noting that 

^ ( t)) M = || (ao)XQo(t)) ; £ (a0)XQo(t)) aM*) (ao)) we get that 

Sfci = _ 9b(a,xa{t k -i)) / \ -i rrki w ith sup \\r k i\\ — > 0, which achieves the proof. The 

1 ' fce{i,..,n} ' 

proof of (ii) is given in Appendix 17.61 □ 



4-2. Study of the contrast process U e> A 

First, consider the estimation of parameters present in the drift coefficient. Using (|4.3p . 
we define 

Ki(ao, a; (3) = / T(ao,a;t)Y,~ 1 (f3,x ao (t))T(ao,a;t)dt. (4.5) 



Jo 

Note that K\ is non negative and by (S4) , if a ^ olq, the function T(ao, a, .) is non identically 
null. Thus, Ki(oto, a, j3) is equal to if and only if a = cto, and defines a contrast function 
for all p. 

Proposition 4.1. Assume (S1)-(S4). Then, as e — > and A — > 0, under ¥g , using 
definition (14. ip for £/ e A 
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(i) sup\e 2 (J7 £i a(«) 0) — U 6j ^(ao, /?)) — Ki(ao, a; /3)| — > in probability; 

6»G0 

(H) &e,A a® in probability. 

Proof. Let us prove (i). Using (|4.ip .we get 

n 

e 2 (t/ e ,A(«,/3) - Uc,A{ato,Pj) = s£ 1^fc(X, a ) - ^( X > ao)]£ _1 G9, Xt fc _! ) [JV fc (X, a) + 7V fe (X, <*„)] , 

fc=i 

Using that N k (x ao , ao) = 0, an application of Lemma 14.21 yields 

e 2 {U, A {a,P)-U cA {^,P)) = A^ ^^'^ S- 1 ^,^^!)) ^^ '^ + %,a,^;e,A). 

fc=i 

The first term of the above formula is a Riemann sum which converges by Lemma f4.1l to the 
function K\{olq, a, ft) defined in (|4.5p as A — > 0. This convergence is uniform with respect 
to the parameters. Let us now study the remainder term. Using Lemma 14.21 we get that 
R(a , a, (3; e, A) = T\ + T 2 + T 3 , where 



n t 



N k (x ao ,a) r „_i, fl v v ni N k (x ao ,a) 



Ti = W°o>"i [£ _1 (/3, Xt h _ x ) - Y.-\f},x aa {t k -i))] 



A 

k=l 



T 2 = Ae\\a - a ||^ %r) k , with V k = E" 1 ^, X^J ( ^^"^ + e\\a - a \\ Vk 

k=l 

n 

T 3 = 2Y, t VkN k (X,a ). 

k=l 

Using Lemma 14. II yields 

|Ti| < 2nA sup ||r(a ,a;t)|| sup ||S _1 (/3, Xj^J - S -1 ^, x ao (t fe _i))||. By Taylor 

te[0,T],a€K a /3eK b 

stochastic formula this supremum goes to zero in ¥g -probability as e — > 0. The term T 2 
contains the random variables r\ k and V k which are uniformly bounded in ¥g -probability 

by Lemma 14.21 Hence \T 2 \ < eT sup r/ k sup \\V k \\ which yields that T 2 goes to 

ke{l,..,n},aeK a fce{l,..,n} 
as e, A — > 0. Finally, we prove that T3 goes to zero in Pg -probability, by setting the more 
general result: 



if sup ||Vy<oo, VV fc iV fc (X,ao) — > 0, in P^-probability. (4.6) 
ke{i,..,n} 

Indeed, Lemma I4TB1 yields 



||E[V fc JV fc (Jr ) a )|.7 7 t fc _ 1 ]|| = llVfeE^fclJVjII < sup HUfcllJEfll^PI^J < CAe. 

ke{l,..,n} v 

Using (USD in Appendix [73] yields ||E [( %N k (X, a Q )) 2 \T th _ 1 ] \\ < CAe 2 . 

Set X n:k = t V k N k (X, ao). We get (|4.6p using an application of Lemma 9 in GEN93I ] (Lemma 



17.21 in Appendix). All convergences above are uniform with respect to 9 and the proof of (i) 
is achieved. 

Let us now prove (ii). The uniformity with respect to a in (i) ensures that the continuity 
modulus of U e> A is dominated, as e, A — > 0, by the continuity modulus of K\. By compacity 
of K a , we can extract a sub-sequence of d e ,A, (&e k ,Ak)k>i w hh a efe ,A fc — > Ooo £ K a . 
Then, by definition (|4.2p of a £i A, < Ki(ao,a oc , /3) < K\(o,q, cxq, /3), which yields, by (S4), 
0.00 = cyq. So any convergent subsequence of a €j A goes to ao which achieves the proof. □ 

The following Proposition studies the tightness of e" 1 (a e ,A — ao) with respect to /3 
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Proposition 4.2. Assume (S1)-(S4). If Ib(ao,Po) defined in (|3.9p is invertible, 
as e, A — > 0, sup ||e _1 (a e a — a o)|| * s bounded in Pg -probability. 

Using definition (|3.9p for the proof given in Appendix 1 7. 71 relies on the two properties, 
for all G K b 

V(i,j) e {l,.., a } 2 ,e 2 |-^(ao,/3) — > 2J 6 (a ,/3)ij , (4.7) 
aajftj e,A->o 

e ^^l AA(0,4/ b (a ,/3)). (4.8) 
For studying the estimation of j3, let us define 

K 2 (a ;Po,P) = ±[ Tr(S- 1 (/3,x ao (t))E(/3o,x ao (t))^ 



o 



T 

i / logde^E"^ WOWo^aoW)) dt-p. 
o 



(4.9) 



Using the inequality for invertible symmetric p x p matrices A, Tr(A)—p — \og(det(A)) > 0, 
with equality if and only if A = I p , we get that, for all p, K 2 (oeo; Po, P) is non nega- 
tive and is equal to if, for all t, T,(Po,x ao (t)) = E(/3, x ao (t)), which implies P = Po by (S5). 

Proposition 4.3. Assume (S1)-(S5). Then, if Ib(ao, Po) is invertible, the following holds 
in P 9o -probability, using P~T|) . ([43]) and ^9\i 

(i)sup\± (UA,e{&e,A> P) ~ ^A, e (a e ,A, Po)) - K 2 (a ] Po, P)\ — ^ 



(H) Pe,A — > Po- 

e,A->0 



e,A^0 



Proof. Let us first prove (i). Using (|4.ip .we get 

±(U A>e (a,P)-U Aje (a,p )) =A 1 {p ,P) + A 2 {a,p ,P) with 



1 n 

A^P) = -Y,log{det [E^X^JE-Vo,^)]) , (4.10) 



n 

k=l 



1 n 

A 2 (a,p ,P) = ^^^(I,a) [S-^A^J-r 1 ^,!^,)] N k (X,a). (4.11) 



fc=i 



Using that, under (S2), x — > log (deb [E(/3, x)E 1 (/3o,x)]) is differentiable on {7, an 
application of the Taylor stochastic formula yields 



At(p ,P) = i ^AY)og{det [E(/3,x Qo (t fc _ 1 ))E- 1 (/3 ,x Qo (^_i))]) + ei^AA^-Oj 

with 1 1 -^(L'o /3o 1 1 un ifo rrm y bounded in Pg probability. Hence, Ai(Po,P), as a Riemann 
i-T 

ges to ^ / /og( (det [E(/3, Xo, (i))E 1 (/3o, x Q0 (t))] ) dt ase,A— > 0. 
Jo 



sum, conver 

Applying Lemma I4T31 to N^{X, ckq) yields 
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A 2 (a ,A),/3) = ^Y. l U k M k U k + (S" 1 ^, X tk _J - TT^faXt^)) E k 

k=l k=l 

with [4 = ^ {B tk - B^) andM fe = l a(P , X^) (ST 1 ^,^) - Tr^X^)) a((3 , X^] 
The random vectors U k are Af(0,I p ) independant of Tt h _ x - Hence, using that for 
U~M(Q t I p ) E{HJMU) = Tr(M), we get 

E[fyM fc l7 fc |.F tfc _ 1 ] = Tr ( M k) = Tr (Z-^frXt^ZifaXt^) - I p ). The first term of 

^("Oj Ah /3) converges to ^ / Tr(£ 1 (/3,x Q , (t))E(/?o,x ao (t))df — p. Joining this result 

Jo 

with the one for A\ , we obtain consistency towards K 2 defined in (|4.9[) . The detailed proofs 
for consistency of A 2 (ao, (3q, j3) and control of the error term A 2 (a e ^, (3 , P) ~ A 2 (ao, PoiP) 
are given in Appendix 17.81 

The proof of (ii) is a repetition of the one of Proposition 14. 11 - (ii) . □ 
Let us now study the asymptotic properties of our estimators. Define the 6x6 matrix 



i r 



ds. (4.12) 



Theorem 4.1. Assume (S1)-(S5). // I b (a , /3 ), I a (a , f3 ) defined in (j4TT2|) are 

invertible, we have under ¥g , in distribution 

e" 1 (a e , A - a )\ f ( //^(ao, A)) \ 

We have already studied the limits as e, A -»• of e 2 ^-(«o, A)) and e d0t( Q°' M in 
Lemma 14.21 These results lead to a £j A asymptotic normality. For /3 ej A we have to set that 
-jE^if( o) -> AA(O,4/ CT (0 O )) in distribution and £|£g£(0o) -> 2I tr (0o)ij in probability. 

Finally, for crossed-terms it is sufficient to prove that q^.Qq . (#o) — ^ in probability. 
Details are provided in Appendix 17.91 



5. Examples 

5.L Exact calculations on Cox-Ingersoll-Ross model (CIR) 

Consider the diffusion on M + defined for a > by 

dX t = aXtdt + el3\fXtdB t , Xq = xq- 

We have 6(a,x) = ax, a(/3,x) = f3i/x and x a (t) = xoe at . The function $ Q define in 
d23H is explicit with $ a (t 2> *i) = e^* 2 "* 1 ). £(/3,x) = /3 2 x and S?'^ = x /3 2 ^Fie"^ 



'fc - A ^ aA 

depends on (contrary to the Ornstein-Uhlenbeck process in Section I3.ip . This is an 

AR(1) process, but the noise is not homoscedastic. Let us define a = e a ^. We have then 
/ n \ / n \ 

a £i A = I '^2^t k Xt k _ 1 J / I y^X f 2 fc i J . With notations introduced in previous sections for 

\fc=i / \fe=l / 
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the different estimators, we have for f|3.2[) : a e ,A = \ln{a). No explicit formula can be 
obtained for d £i A and a £i A defined in (|3.1ip and (|4.2p . 



We can also calculate the asymptotic covariance matrix (j3.9|) : Ib(a, (3) = x °^ a ^ - Noting 
that D fc (o) = Ae QfcA , we get for (1X13]) : I A (a,/3) = J 6 (a,/3) x a. Setting J b (a,/3) = 

£^ ij ^f, we obtain for (32]): J A («,/3) = J b (a,P)x% (fe 1 ) (S^)'- We remark that 
Jb(a,(3) < Ib(a, 0), VT > 0. So, as expected, VA > 0, J&.{u,0) < l&{a,0). Hence, contrast 
estimation with prior knowledge on the model multiplicativity (see Section f3.3.2|) leads to a 
more accurate confidence interval than the general case with no available information on 0. 



5.2. A two factor model 



We consider here the same example as IGLOOS (see e.g. |L0N95^ . Let us define 
X t = (Y t , Rt) as the solution on [0, 1] of 



dY t = {R t + /ii) dt + e Kl dB}, Y = y eWL 

dR t = p 2 (m- R t ) dt + eK 2 VRt (pdBl + y/l - p 2 dB^j , R = r > 0. 



(5.1) 



fRi 



Hence, we get that S((«i, K 2 ,p), (Y t ,R t )) = ( Kl r-^- ^^P^ 1 

Kt K, 2 ri 

For tq ^ m the diffusion process satisfies (S1)-(S5) and we can estimate parameters a = 
(Mi> P2, m) and = (k? , k 2 , p) with our minimum contrast estimators defined in (|3.2|) . (|3.1ip 
and ()4.2p . As [GLO09I ]. we investigate the case of p\ = p 2 = m = k\ = k 2 = 1, p = 0.3, 
and (yo^o) = (0, 1.5), for two values of e, 0.1 and 0.01. Similarly, we present in Tables I5T21 
and 15.21 contrast estimators (empirical means and standard deviations) over 400 runs of the 
diffusion process (|5.ip simulated based on an Euler scheme. For each of these simulations, 
different values of the number of observations n are used (n = 10, 20, 50, 100 observations) 
to infer parameters. Results of Gloter and Sorensen were reproduced usin g their c ontrast 
based on an expansion at order 2 of the function defined in S ection 2. 3. in GLO09I ]. 



GLO09I ]. When e = 0.1, we 



For e = 0.01, Table 15.21 results are very similar to those in 
can distinguish two different pattern s. For a and (a, (3) results exhibit a lack of accuracy 
on //2, similarly to those of [GLO09I ]. The second pattern concerns a, where the bias on 
p 2 (for n = 10,20,50), less important than for a and (a,/3), is partially balanced by an 
increase in the uncertainty of m. These results show that prior knowledge on the model 
(more specifically fixing the diffusion parameters to their true value) leads to a different 
behaviour of the estimator. From a theoretical point of vue, this is explained only by the 
shape of S^^°, which does not consider equal weights for all observations. 
In addition, a decrease in accuracy is obtained when increasing the number of observations. 
This could be explained by the behaviour of N^{X, a) which depends on the variation of 
slope between two consecutive data points. Indeed, in this particular model (|5.ip . where 
the drift is almost linear (and hence the local gradient close to zero), variations of local 
slopes increase with the number of observations randomly distributed around the global 
slope. When performing the estimation on a longer time interval with the same number of 
observations ([0,5], n = 50), the decrease in accuracy following the increase in the number 
of observations is partially counterbalanced. 
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e = 0.01 


a (/? unknown) 


a (/3 = fa fixed) 




( 


a, $) (Small Delta) 




n 


Mi 


M2 


m 


Mi 


M2 


rh 


Mi 


M2 


rh 


hi 


hi 


P 


n=10 


1.001 


1.007 


0.996 


1.000 


1.005 


0.997 


1.000 


1.013 


0.999 


0.971 


0.728 


0.328 




(0.01) 


(0.13) 


(0.04) 


(0.01) 


(0.12) 


(0.04) 


(0.01) 


(0.13) 


(0.04) 


(0.45) 


(0.37) 


(0.33) 


n=20 


1.000 


1.012 


0.999 


1.000 


1.003 


0.997 


1.000 


1.012 


0.999 


0.973 


0.853 


0.306 




(0.01) 


(0.12) 


(0.04) 


(0.01) 


(0.12) 


(0.04) 


(0.01) 


(0.12) 


(0.04) 


(0.32) 


(0.29) 


(0.23) 


n=50 


1.000 


1.012 


0.999 


1.000 


1.000 


0.996 


1.000 


1.012 


0.999 


0.982 


0.910 


0.302 




(0.01) 


(0.12) 


(0.04) 


(0.01) 


(0.12) 


(0.04) 


(0.01) 


(0.12) 


(0.04) 


(0.20) 


(0.19) 


(0.14) 


n=100 


1.000 


1.013 


0.999 


1.000 


0.995 


0.995 


1.000 


1.011 


0.999 


1.001 


0.953 


0.310 




(0.01) 


(0.12) 


(0.04) 


(0.01) 


(0.12) 


(0.04) 


(0.01) 


(0.12) 


(0.04) 


(0.14) 


(0.14) 


(0.09) 



Table 1: Mean (standard deviation) of minimum contrast estimators for parameters of (|5.1fl based on 400 
simulated trajectories with fj,i = [i% = m = Ki = «2 = 1, p = 0.3, using e = 0.01 and n = 10, 20, 50, 100. 



e = 0.1 


a (/3 unknown) 


a{P = Po fixed) 




( 


5, $) (Small Delta) 




n 


Mi 


M2 


m 


Mi 


M2 


rh 


Mi 


M2 


rh 


*? 


ki 


P 


n=10 


1.000 


1.723 


0.892 


1.005 


1.052 


0.667 


0.998 


1.678 


0.997 


0.927 


0.769 


0.422 




(0.10) 


(1.23) 


(0.43) 


(0.10) 


(0.92) 


(0.49) 


(0.10) 


(1.23) 


(0.41) 


(0.43) 


(0.36) 


(0.23) 


n=20 


1.001 


1.754 


0.922 


1.011 


0.930 


0.590 


1.000 


1.718 


0.930 


0.966 


0.864 


0.344 




(0.10) 


(1.24) 


(0.40) 


(0.10) 


(0.90) 


(0.51) 


(0.10) 


(1.20) 


(0.39) 


(0.29) 


(0.29) 


(0.18) 


n=50 


1.000 


1.760 


0.928 


1.029 


0.509 


0.342 


1.001 


1.82 


0.994 


0.971 


0.832 


0.167 




(0.10) 


(1.23) 


(0.40) 


(0.10) 


(0.70) 


(0.61) 


(0.10) 


(1.18) 


(0.31) 


(0.09) 


(0.08) 


(0.07) 


n=100 


1.001 


1.778 


0.933 


1.051 


0.122 


0.410 


1.000 


1.825 


0.987 


0.979 


0.846 


0.156 




(0.10) 


(1.23) 


(0.40) 


(0.10) 


(0.27) 


(1.22) 


(0.10) 


(1.19) 


(0.33) 


(0.07) 


(0.06) 


(0.05) 



Table 2: Mean (standard deviation) of minimum contrast estimators for parameters of (|5.1|l based on 400 
simulated trajectories with fj,i = 112 = m = Ki = K2 = 1, p = 0.3, using e = 0.1 and n = 10, 20, 50, 100. 



5.3. Epidemic models and data 

Here, we present an example where e, corresponding to the normalizing constant 1/yN 
has an intrinsic meaning. One of the simplest models for the study of epidemic spread 
is the SIR (Susceptible-Infectious-Removed from the infectious chain) model, where each 
individual can find himself at a given time in one of these three mutually exclusive health 
states. 

One classical representation of the SIR model in closed population is the bi-dimensional 
continuous-time Markovian jump process: Xf = (St, It) with initial state Xq = (N — m,m) 

and transitions (S, I) — > (S — 1, 1 + 1) and (S, I) -^U (S, I — 1). 

The normalization of this process based on the population size N asymptotically leads to 
an ODE system: x(t) = (s(t),i(t),r(t) = 1 - s(t) - i(t)), with x(0) = (1 - m/N,m/N,0), 

— \X\X2 
\X\X2 - JX 2/ 

Before passing to the limit, by defining S((A,7),x) = ( ^ lX2 \x\X2 1 ^ 



which is solution of (|2.3p for b((X,j),x) 



-\xiX 2 \XIX 2 +7X2; 

write the infinitesimal generator of the renormalized Markovian jump process (X(t)/N) as 
the solution of 

An (fix)) = N\x!x 2 (f(xi -jf,x 2 + jf)- f(xi,x 2 )) +N-yx 2 (f(xi,x 2 - w)~ f( x -L> x 2))- We also have 
A N (f(x)) = A ( P (f(x)) + A% +) (f{x)), with 

•^n Ui x )) = b ((\l), x ) V f(x) + 2F ^2 dxdx^ X ^^ X,1 ^ X ^' j Where contains 

i,j=l 1 3 

all the derivatives terms of order 3 and above. Then, approximating the renormalized 
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(2) 

Markovian jump process by a Markov process with generator A N , leads to a diffusion 
process Xt = (st,it) with drift b and diffusion matrix S, which can be rewritten as the 
solution of: 

ds t = -Xs t i t dt + -^yJ\s t itdBi(t) 

(5.2) 

di t = (Xs t i t - lk)dt - -^y/Xs^dB^t) + -^y/yT t dB 2 {t). 

Here, A and 7 represent transmission and recovery rates, respectively and are the two pa- 
rameters to be estimated. 

So, system (|5.2|) can naturally be viewed as a diffusion with a small diffusion coefficient 
( e = AT- 1/2). Mor 

eover, parameters to be estimated are both in drift and diffusion coeffi- 
cients, with the specificity that a = (A, 7) = j3 (with the notations of (|2.ip ). Besides, since 
epidemics are discretely observed, the statistical setting is defined by data on a fixed interval 
[0, T], at times = kA, with T = nA and n the number of data points (A not necessarily 
small) . 

The performances of our method for epidemic models in the case of a fixed sampling inter- 
val A and for A — > were evaluated on discretized exact simulated trajectories of the pur e 
Markov jump process Xt and compared to estimators provid ed by th e method of GLO09| |. 



We considered the Maximum Likelihood Estimator (MLE) [ANDOoI ] of the Markov Jump 
process, built using all t he jump s, as the reference. Simulated data were generated by using 
the Gillespie algorithm [GIL77 ] after specifying (N, m,A,7). Two population sizes were 



considered N 6 [100; 10000] and m/N was set to 0.01 for all simulations. (A, 7) were chosen 
such that their ratio takes a realistic value. Indeed, A/7 defines for the SIR model used 
here a key parameter in epidemiology, Rq, which represents the mean number of secondary 
infections generated by a primary case in a totally susceptible population. We have chosen 
Ro = 1.2,7 = 1/3 {days' 1 ) (and hence A = 0.4 (days' 1 )) to represent a realistic scenario 
(parameter values close to influenza epidemics). We considered T = 50 days, in order to 
capture the pattern of an average trajectory for the case iV = 100 (shorter epidemic dura- 
tion than for N = 10000). Three values of n were tested: 10,50, 100. For each simulated 
scenario, means and theoretical confidence intervals (95%) for A and 7 were calculated on 
1000 runs for each parameter and for each estimation method. 

Figures [U and [2] summarize numerical results (only drift estimators are provided). Accord- 
ing to our findings, contrast based estimators are very effective even for a few amount of 
observations, compared with the MLE. As expected, for all scenarios, we can see an im- 
provement in th e accurac y as the number of observations increases for estimators a, a and 



the estimator of [GLO09I ]. On the contrary, a-estimators accuracy decreases as the number 
of observations increases. This phenomenon is due to the shape of S^'^ 3 (defined in ([2. lip ), 
which confers greater weigths to the beginning and the end of data (as for the two factor 
model ()5.ip above). For A~ = 10000, it is important to notice that the bias is quite negligible 
from an epidemiological point of view. Indeed, the bias for I/7 has an order of magnitude 
of one hour whereas an accuracy of one day would be acceptable. For the case N = 100, 
only emerging trajectories were considered, based on an epidemiological relevant criteria 
(epidemic size above 10% of the population size). We can remark that MLE provides less 
satisfactory estimations for 7. Our contrast estimators for n = 100 perform globally well, 
except for a. But even in this last case, contrary to the MLE, the ratio A/7 is close to the 
true value despite a bias on both A and 7 separately. 
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Number of observations 



Figure 1: Mean values and theoretical CI (95%) of the estimators of A and 7. Labels are 0: MLE (with all 
data available), 1: a (ft unk nown), 2: a (/3o known), 3: a. {fi = a) , 4: a (small A) and 5: the estimator of 
drift parameters in |GLO09l |'). Results based on 1000 runs for N = 10000, T = 50, A = 0.4, 7 = 1/3 and for 
n = 10,50, 100. 
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Figure 2: Mean values and theoretical CI (95%) of the estimators of A and 7. Labels are 0: MLE (with all 
data available), 1: a (P unk nown), 2: a (/3o known), 3: a. {fi = a) , 4: d (small A) and 5: the estimator of 
drift parameters in [GLO09I ]'). Results based on 1000 runs for N = 100, T = 50, A = 0.4, 7 = 1/3 and for 
n = 10,50, 100. 

Our results are promising in the epidemiological context, since the minimum contrast 
estimators are both accurate and not computationally expensive, even for very noisy data 
(N = 100). Ongoing research is devoted to the extension of these findings to the more 
realistic case of partially observed epidemic data. 
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7. Appendix 



7.1. Some useful analytical properties 

We state here a series of regularity properties of (a, t) — > <3? a (i, to) and x a (t). Let us 
first consider <E> a . A Taylor expansion of t — > & a (t,t k -i) yields using ([27 



®a(tk,tk-i) = ^ P +A||(a,x Q (tfc_i)+Ar(cM fc _i, A) where r(a, t k -i, A) converges uniformly 
to on [0,T] x K a . Hence, 

1 C^fe 

I- ($ Q (i fc ,i fc _i) - I p ) - — (a,x Q (t fc _i))| — > 0. (7.1) 
A ox A->o 

As a consequence (a, t) — > <fr a (t,to) is uniformly bounded on K a x [0, T\. Consider now the 
properties of a — > & a (t,to). 

Lemma 7.1. Under the assumption that b(a,x) £ C s (K a x U), the function a — > & a (t,to) 
is in C 2 {K a ). 

Proof. Classically, we just prove here that <5 Q is continuous w.r.t. a if a — > ^(a, x a (t)) is 

continuous. Set Mh(t) = & a +h(t>to) — &a(t,to). Using (|2,4p . we have 

/"* db h ( db db \ 

M h (t)= I — (a+h,x a+h (s))M h (s)ds+ \,—(a + h,x a+h (s))-—(a,x a (s))J$ a (s,t )ds. 

By (17. ip and the continuity of t — > & a (t,to), we can define Kq = sup \\$ a (t,to)\\ and 

K a x[0,T] 

/"* db db 
K = sup || (a, x a (t))\\. Setting j h (t) = K I \\ — (a + h,x a+h (s))-—(a,x a (s))\\ds, 
K a x[o,T] ax J t dx dx 

we have ||M/j(i)|| < 7/ l (t) + i^ / ||Mh(,s)||ds. Applying GronwalFs inequality to ||M^|| yields 

t 

\\M h {t)\\ < j h (t) + K ^ h {s)e K{t - s) ds. By the Lebesgue dominated convergence theorem, 
■Jt 

'jh (t) goes to as h — > 0, which implies the same property for Mh(t). □ 

The existence of derivatives for $ Q w.r.t. a are obtained similarly. Moreover, expanding 
^f' dltfaj in Ta y lor series at P° int *fc-i> th -ey satisfy 

Vz<a, |- — (a ) ^ (ao.XaoCtfc-OJI^O, (7.2) 

w . i 1 ° 2 ®<x(h,h-i) , v (9 3 b(a,x a (t fc ,i)) 

Vz,j<a, — — — (a ) a a a a ,s QO fe-i — ► 7.3 

and all left terms are bounded as A — > 0. 

Let us now consider x a and its derivatives. Using (12. 3|) . and expanding t —> x a (t) in Taylor 
series at point t k -\, as above, yields 

|-r (za(*ft) - ^a(4-i)) - K^^a^it-i))! — ► 0, (7.4) 
Z\ A->0 

1 ( dx a (t k ) dx a (t k _i)\ db(a,x a (t k _i)) 

(«o) a ( a o)| — ► 0, (7.5) 



1 (d 2 x a (t k ) d 2 x a (t k _ 1 ) \ db(a,x a (t k -i)) 

(«o) q — q («o) 5 — 5 ("oil — (7.6) 



A V daidctj dcxidaj J dotidotj a^o 
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The proof of (i) is given in [FREI84I ] (Theorem 2.2) but we need a more refined result on 
the increments of Rg' e (t). For sake of clarity, we omit in the sequel 9 and a (and therefore 

denote ^(xq) by f'(xo)), and we denote by |||| either a norm on W p or on M P (R). We study 

successively Rg' £ (t) and R 2 g ^(t). 

Using Xf = x(t) + e22 1,e (t) and (l2T|) . i^' e satisfies 

fl 1 . 6 (t)= f -(b{x(s) + eR 1 ' e {s))-b{x(s)))ds + [ a(x(s) + eR 1 ' e (s))dB s , R 1 ' e (0) = 0. 

Jo e Jo 
Hence, iZ 1,e satisfies a stochastic differential equation with drift d e (t,z) and diffusion 

coefficient v € (t, z) where d e (t, z) = - (b(x(t) + z) — b(x(t))) and v e (t, z) = a(x(t) + ez). 

Using that the derivatives of b and a are uniformly bounded on U, these two coefficients 

satisfy 

\\d e (t, z) — d e (t, z')\\ < sup\\b'(x)\\ \\z — z'\\, \\v e (t, z) — v e (t, z')\\ < sup\\a' (x)\\e\\z — z'\\ 
and \\d e (t, z)\\ 2 + \\v e (t, z)\\ 2 < Ci(l + \\z\\ 2 ) where C\ = max(sup\\b' (x)\\, sup\\a' (x)\\). 

x&J x&U 

Hence, X\ is finite and independent of e. An application of Theorem 2.9 of 



KAR91p yields that there is C a constant depending only on C\ and T such that 



Vt < T, E [Hi? 1 - 6 ^)!! 2 ] < Ce ct . 
Let us now study R 2 ' € (t). Using (|2.6p we get, 

R 2 > e (t)= [ d e (s,uj,R 2 ' e (s))ds+ [ v e (s,Lj)dB 8 , R 2 ' € {0) = 0, (7.7) 
Jo Jo 

with d e (s,u,z) = ±r (b{x(t) + eg{t,u) + e 2 z) - b(x(t)) - eb'(x(t))g(t,u)), 
v e (s,J) = \ (a(x(t) + eR l > e {t,uj)) - a(x{t))) . 

First, let us check that the stochastic integral above is well defined. For this, we 
compute E || / v e (s, ui) t; v € (s, ui)ds\ 



v e [s, UJ 
and 
E 



Applying a Taylor expansion to a(x(s)) yields 
f a'(x(s) + ueR 1 ' e (s))dv) R 1 ' e (s). Hence, ||S e (s,w)|| < «ip||<J / (x)||||il 1 ' e (s)|| 

Jo J xeU 

r t -i r t Ct _ i 

/ v £ {s,u)%{s,u)ds\\ <sup\\a'(x)\\ 2 E [||i? 1,e (s)|| 2 ] ds < C 2 ——-, — . 

Jo J x&U Jo ^ 



Consider now the drift term d e (s,u,z). A Taylor expansion with integral remainder yields 

d e (s, uj, z) = (b(x(t) + eg{t) + e 2 z) - b(x(t) + eg(t)) + (b(x(t) + eg(t)) - b(x(t)) - eb'(x(t))g(t)) 

= e 2 ( [ b'{x{t) + eg{t) + ue 2 z)du J z + e 2 *g(t) ( [ (1 - u)b"(x(t) + ueg(t))du J ^(t). 



r 2 



Hence, d e (s,u), z) is bounded independently of e by 

||d e (s,a;,z)|| < stip6'(x)||z|| + snp||6"(x)|| ||g(i)|| 2 . Now, using (|7.7p . we get 

x£U x£U 

||i? 2 ' e (t)|| 2 < 2 (\\J de(s,u;,R 2 > e (s))ds\\ 2 + || J v e (s,u)dB s \\ 2S j . We already prove that the 

last term above has a finite expectation. It remains to study the first term. 

E [Hi? 2 '^)!! 2 ] < 2Cf [ E [||i? 2 ' e (s)|| 2 ] ds + H(t) with 
J o 

rt Ct _ 1 

H(t) = 2sup\\b" (x)\\ 2 / E [||5(t)|| 4 ] ds + C\ — - — . Applying Gronwall's inequality yields 
x&J Jo C 
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E [||i? 2 ' e 0)|| 2 ] < H(t) + 2C\ [ H(s)e 2C ^ t - s) ds. Since g(s) is a continous Gaussian 

J o 

process, sup E[|| 5 (s)|| 4 ] is finite, and \H(t)\ < Kt, so that E [||i? 2 ' € (t)|| 2 ] < K't with 
se[o,T] 

if' = K(l + 2Cf). 

/t+h rt+h 
d e (s,oo,R 2 ' e (s))ds + / v e (s,cj)dB s . 

E [||i? 2 > £ (i + fc) - ^(t)|| 2 ] = E [E [\\R 2 >*(t + h) - R 2 >%t)\\ 2 ] \JF t ] = E[E Xt [||^ 2 ' e W|| 2 ]] . 
By the Markov property of X t we get that E Xt [||i? 2 ' e (/i)|| 2 1 < K'h. 



7. 3. Proof of Proposition HO 

Let us first prove (i). The processes U eA (ot, (Xt k (uj))) are almost surely continuous with 
continuity modulus 

w(U eA ,rj) = sup{\U eA (a, •) - U eA (a' , •)!,(«,«') G K 2 , \\a - a'\\ < r]}. 

We have \U eA (a,_(X tk )) - U eA {a>, (X t _ k ))\ < \U eA (a, (X tk )) - U eA (a, (x ao (t k )))\ + 

\U eA (a', (X tk )) - U eA (a', (x ao (t k )))\ + \Ka{olq,o) - K A (a ,a')\. 

Using formula (g^]), N k (X,a) - N k (x ao ,a) = eR l e ^{t k ) - $ a {t k , t fc _i)ei?^ e (i fc _i) and 

n 

\U eA (a,(X th )) - U eA {a,(x ao (t k )))\ < ^J2\\N k (X, a) - N k {x ao , a)\\ \\N k (x ao , a) + N k {X, a)\\ 

k=l 

< ^ sup \\eR e ' o e (t)\\ sup \\I P + $a(t k ,t k -i)\\\\N k (x ao ,a) 

t6[0,T] a&K a ,k£{l,..,n} 

Let (f){rj) = sup{\Ka{oiq, a) — Ka{oiq, a')|, (a, a') G K 2 , \\a — a'\\ < rj}, we obtain 
w(U t A,v) — > 't'i'n) under Fg . Assumptions (S1)-(S4) ensure that </>(r/) — > 0. The proof 

of (i) is achieved using Theorem 3.2.8 ( DAC93J). 



Consider now the second derivatives of U e (a, .). Noting that, 



we have 



+^&^ il («o) [Xt^ - x ao (t k ^)] 

d 2 u f 



daidctj 



(a ) = 2eVAE 1 + 2AE 2 , 



with Eh. = J2a * * »o -=JV fc (X,ao : 



fe=i 
ra /it 



2 = ^ I A ^ (ao) A ft,- M 



fc=i 



v 



Using (|3.7p . (|3.8p and that ^ - Q^^da^ (qq) is bounded in probability, yields that £i and 



are bounded in probability. Hence, J^.q^. (qo) — >• 2A^^ t D k i(ao)D k j(ao) = 2MA(oco)i,j- 

k=l 

The consistency result obtained in (i) , and the uniform continuity of a — > & a and its deriva- 

\^(a +t{a eA -a ))-^(^jii 

te[o,i] 

which completes the proof of (ii). 



tives (see Lemma l7.ip . yields that, under Pa . sup \\^%- (ao+t(a eA — «o)) — ^^(^o)!! — > 0, 
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7-4- Proof of Proposition \3.2\ 



The proof of (i) is a repetition of the proof of Proposition 13. 1[ The proof of (ii) contains 

( 



additionnal terms due to the presence of S^'^ a ^ in the contrast process: 
- l9U ff(-o) = Ti+e T|, with r| = 2VA£ ( (S^T 1 ( ^pl) , 



doii 



k=l 







(W(«))-i 



(«o) 



, A 

k=l \ 

N k {X,a ) 



For all i, the term T| is bounded in probability since S^'^ inherits from $ a its differen- 
tiability with respect to a and that ^^=p^ is bounded in probability by (13, 7|) . Using now 
(ETT)) and (|5^]> . we obtain, as before, that (T{) 1<i<a — > AT (0, 4I A (o! , £ )). 

9 2 J/a, £ (qo,/3 : 



<e->0 



fc=l 



r* J + 2e\/Ar2 J + e 2 T3 J , where for all i, j < a: 

("o)> 



T ! j OA v^ 1 t dNj e (X 



da,- 



1 *9 2 iV fe (X,a) 



T 2 3a,-3a,- M(5fe } 



A day 

N k (X,a ) , 1 ^(^a). , 9 

7^= ^ a" 5 (aoj- 

eVA A dcti 



5 «j 



(«o) 



N k (X,a Q ) 
e^A 



J 3 



^*iV fc (X,a ) 92 
fc=i 



(s? /C«))-i 



eVA 



(«o) 



da.ida.j 
re 

<9 2 C/ Ae (ao,/3o) 



eVA 



The two terms T^ 3 and T3' 3 are bounded in probability and therefore 



dctidctj 



<E->0 



2J A (ao, Po)i,j 



7. 5. Proof of Lemma \4-S\ 

Proof. Let us study the term E k defined in Lemma 14.31 We have E k = + E% with 
rt k 

E k= (b(a ,X t ) -b(a ,x ao (t)))dt+ ( I p - $ Q0 (i fe , tjfc-i)) [X fc _! - (tjt_i)] and 
E k = e f k «fo,Xs) -o-%,X tk _,)) dB s . 



Using that x — > 6(a, x) is Lipschitz, we obtain 



\El\\ <AC sup \\X t - x ao (t)\\ +Ae\\ / — (a ,a;ao(*))*a (t,tfc-i)dt^; 6 (tfc-i)|| 
te[t*-i;tfc] Jo OT 



te[tfc_iit*] 



The proof for Ei follows the sketch given in [GLOOfll ] (Lemma 1). We prove this result 
based on the stronger condition £ and b bounded (similarly to Gloter and S0rensen in 
Proposition 1 |GLO0d | ). 

We use sequentially Burkholder-Davis-Gundy's inequality and Jensen's inequality to obtain 
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\ m/2 

\a{^X s )-a{^X tk _ 1 )fds\ \Tt k _, 



< Ce m A m /2-i / E [\\<t(Po,X b ) -a^Xt^mTt^] ds. 
Jt k -i 

Then, using that x — > x) is Lipschitz, we obtain: 

ECll^n^-J < C'e^A^t EiWXs-Xt^W^ds 

Jtk-i 



< c'e m A m / 2 ~ 1 / E 



(b(a , X u )du + ecr(/3o, X u )dB v 



ds. 



Since b is bounded on U, || / b(ao, X u )du\\ < K\s — t k -\\ and the Ito's isometry yields 

Jtk-i 



E 



a(p ,X u )dB u \ 



t k -i 



< E 



Y,(p Q ,X u )du\ 



t k -i 



m/2 



<K\s-t k ^l\ 



Thus, E l\\El(p Q )\\ m \F tk J < c"e m A m / 2 ~ 1 / |s - i fe _iP /2 ds < C (3) e m A m . 



□ 



The two following results are consequences of Lemma 14.31 Define, for M a symmetric 
positive random matrix, using (|2. 13j) : 



iV| )0 (M) = t N k (X,a )M^ 1 ^ ,X tk _ 1 )N k (X,a ) G R. (7.8) 

Now, for i = 1,2, if (M^ 1 )/ C >i is a sequence of l -measurable symmetric positive matrices 
sup\ 

k>l 

1 , 1 



of Mp(IR) satisfying sttpHM^^H is finite in probability, 
fc>i 

1 i n 



,A->0 







i n 



(7.9) 



— ► 

e,A->0 

(7.10) 



Indeed, under Pg , we have 
E [ t N k (X,a )M k 1 } 1 N k (X,a )\Tt k _ 1 ] = £ (M^\)ijE [N k (X, ao)iN k (X, ao)j\J r t k _ 1 ] 



where which leads to 
1 



<,3'=1 



< 



fc=l 

n 

fc=l 



c 



ke{i,..,n} 

< C'VA 

The proof of (|7.10p is similar and not detailled here. 
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7. 6. Proof of Lemma fa) 

Using (HH, we have ^ ^ (oo) = /^'W d \a ) with 

4 3 i (iJ ° = i^&^(«o) [Jrj x -zoofo-!)], and 

+ a ^ («o) — as" — («o) + ^ («o) — — (aoj ) ■ 

Using (|7.2p . (j7.3|) and (|7.6p we obtain that the deterministic quantity ||/^'^||oo is bounded 

(3) (i 7') 

as A — > 0. Finally, r]i_^ is J 7 ^^ -measurable and goes to zero due to Taylor's Stochastic 
formula as e, A — > 0. 



7. 7. Proof of Proposition 



Let us first recall Lemma 9 in [GEN93I ] that we use in the proof adapted to our notations. 
Lemma 7.2. Let (X^) be a Tt k -measurable random variable (with t^ = kT/n), then assume 

■ 0, then 



k\2\ T7 



that f-X'nl^i* _ 1 U , with U a random variable, and |^(X 
k=i fc=i 

n 

convergences are in probability. 

fc=i 

A Taylor expansion with integral remainder, for function 9 g^ A at point (ao, /3<e,a) yields 
for % < a 

dU etA (a , /3 £)A ) _sr-{ f 1 2 d2 Ue,A 



/f d U a \ 



don — KJ0 7 i - 



Then, setting 

V(°^0, Pe,A)i,j = ( f g 2 9 UeA (a + t(a e A ~ a ),$e,A) ~ ^ U ^ ( Q 0' &e,A) dt 

\J daiCtj daiOj 



+ e2 a^I^( a o,/3 e A) - 2 4(ao,,/3 e ,A)i,j, 



with Lb defined in (j3.9[) we get 

[2L b (a , Pe,a) + v( a o,$e,A)] e ~ 1 («e,A - «o) = -e ^ («o, /3 £ ,a)- (7.11) 

To obtain the tightness of the sequence e~ l {a e ^ — oiq) w.r.t. /3, we first study the right 
handside of (17TTO . 

Using LemmaP(i), for all t E {1, ..,a}, e du <M^M = ^ {C{ + D{), 

fc=l 

with q = f *^(a ,2 ;Qo (i fe _i))S- 1 (/3,x ao (t fe _ 1 ))iV ifc (X,Qo) and 

^fc = f g|(ao,Zao(**-i)) [S-H^^.J-S-HiS^aoCtjfe-i))] JVfc(^,ao) 
+2||a-a || t %S- 1 (/5,^ fc _ 1 )A r fe(^,ao). 
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Set Cf. = Cj, — E [C|| J r t fe _ 1 ] . Let us consider the centered martingale ^^Cfc- In order 

I 1 k=1 

to apply a central limit theorem (see HAL80I ] Theorem 3.2 p. 58) we have to prove that 

n 

sup\Ci\ — > O.VQCl — > 4f 6 (oo,i8)ij andE 



fe=l 



sup ( C 



< oo . 



Note that, since the limit Ib(ceo,/3) is deterministic, no nesting condition on the c-fields is 
required. 

Applying Taylor's stochastic formula to N k (X, «o) in C k expression yields 

C\ = 2^(a ,x Qo (t fc _i))S- 1 (/3,x ao (t fc _ 1 )) (VAZ^° + e{R 2 ^(t k ) - R 2 ^(t k ^))). Hence, 



suplCt — E \Ci\Ft, ,1| — ► and E 

n 



^(qt-E^iJi^]) 2 



< oo. It remains to 



prove that ^CfcCfc — > ^h( a 0i Let us apply Lemma 1731 with X n ^ = C\& k . 

k=i e ' 



Then, £e [&A\^ 

k=l 



1 



[JVfc,o( Af fc-i)l jr t fc -i] where ^1,0 is defined in (USD and 



M fc _! = 4 ^-(a ,x ao (i fc _i))S- 1 (/3,x QO (i fc _ 1 ))||(ao,x Qo (t fc _i)). Using yields that 



fc=l 

n 

£e [(CicJ) 2 !^,] = C(A) 0. 



4J 6 (ao,j0)i,-| — )• 0. Moreover (ETUI) leads to 



fc=l 



Now, we prove that the centering term ^~^E [C^|j c j fc _ 1 ] — > and ^^D\. -^-^ in prob- 



k=l 



k=l 



ability. For D l k , (|2.6p ensures that ||- [S 1 (/3,X ifc _ 1 ) — S 1 (/3, x Q0 (t K _i))] || is bounded in 
P# -probability. Hence, using Lemma [ 



VZ 



2 

doii 



(oo, xoo^-i)) [S- 1 ^, *t*_ a ) - x Q0 +2||a - a || V^'HA 



is bounded in probability for all k. Since ensures that 

n 

^£>1 in P eo -probability. 



fc=i 



Set V fc _i = S 1 (/3,a; Q , (t fe _i))^(a ,a; ao (t fe _i)). 
Then, E [C^Ji^J = ± V fc _iE [jV fc pf, o^l-TVi] • By the Taylor's stochastic formula 

N k (X,a ) = e^/AZ^ + e 2 fe(t fc ) - $ ao (i fe , i fe _i)i# e (i fe _i) 



+ A 



(/p-$c« (*fc,«fc-i)) p 2 



An 



Using that ^ is independant from J r t k _ 1 , 

E [Vfc-xiVfe^ao)!^,] = e 2 V fe _i [e [i?^(*fe) - <, e fe-i) 
Abel transformation to the series yields 

|±VE[V K _!iV fe (X,ao)|;Vi]l < r s ^ n ^fe- Vfc-i n TOp l|e^' e (t)l|. Using now that 
^ fce{i,..,n} A te[o,T] 

sup || v k Vk-i || - g bounded, we obtain that in probability 

ke{l,-,n} 



1 



(7.12) 



k=l 
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Combining all these results we get (|4.8p . Let us now study 77(0:0) $e,A)i,j defined by (|7.1ip . 

n 

e 2 S^(«o,/3) = E(^'+^) with 

4 J ' = 2A\^-^(a ))^HP,X tk _ 1 ) (i^g^(ao)) and 

Using that E -1 (/3, X ife _ 1 ) converges toward £ 1 (/3o, Xq, Lemma I4.41 (i) yields that 

n 

— > 2Ib(ao, (3o)ij (additionnal terms are negligible since they are bounded by 

k=l £ ' 

nA sup H^aIU)- 

k£{l,..,n},a£K a 

n 

Applying Lemma 14.41 (h) and (|4.6j) yields that ^^B]f — > in Fg -probability. Joining all 

fc=i 

the results we get (|4,7p . In addition, since the limit of e 2 da c a A (ao, (3) is deterministic, we 
have 



d 2 U € a d 2 U e 

Vi G [0, 1] sup ||e 2 — — —(a + t(a e A - a ),/3) - e 2 - -(a ,/3)|| < K\\a e a - Qt ||. (7.13) 

P£K h oaictj daiaj 

Joining (|4.7p and (|7.13p ensures that sup ||r/(a!o,/3)|| — > 0. It remains to prove that 

/3&K b ^ A ^0 

Ib(ao,f3) is invertible for all (3. According to (S2), S(/3,x) is invertible V(/3,x) £ K^x U, 

which ensures that S _1 (/3,x) is a coercive bilinear application. The set K b being compact, 

the coercive constant can be chosen independently of (3. Using (j3.9|) 

rT 

*ds = Cq, with Co strictly positive because 
2fo(ao,A)) is invertible. 

Noting Com(M) the comatrix of M, we have that sup \\ t Com(Ib(cto, < 00 as a 
continuous function of (3 and 

^m||e" 1 (d ei A- a o)|| < ^ sup\\ t Com{I b (aQ,l3))\\sup\\)^A{aQ,P)\\. Hence e _1 (a £ ,A-ao) 
is bounded in ¥g -probability, uniformly w.r.t. (3, which achieves the proof of Proposition l4.2i 



vnf det(I b (a ,l3)) > C± \ || |' 



7. 8. Proof of Proposition \4-3\ 

Using notations (|4.10p and (|4.11|) . we get 

\ (UA,e{ & eA> P) ~ #A,e(ae,A, AO) = A 1 (fi, (3 Q ) + A 2 (a ,(3, (3 ) + {A 2 (a eA , (3, (3 ) - A 2 {a 0l (3, f3 )) . 
We already obtained the convergence result for A\{(3, /3q). Let us study ^42(«o> P, /?o)- Using 

n 

CLBD, A 2 (a ,(3,(3 ) = ^^iV| !0 (M fe _i(/3,^o)), with 

fc=i 

Mf c _i(f3, (3o) = S _1 (/3, X tk _ 1 )S(/?o 5 X tk _ 1 ) —I p - Let us now control the conditional moments 
of JVgoCMfc-iCftft)). Using {LSD yields 
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Hence, ^Vl [JVf (M fc _i(/3, A)))PVi] -> ^ / ^(S" 1 ^, x Q0 (t))£(/? , * ao (t))ett - p, 

fc=i J ^ 

uniformly w.r.t. /3. Using (|7.10p yields 

n a " 

|^^E [(^(Mfc-i^.A)))) 2 !^.,] - -E [^(Mfc-iGS.A))) + 2Tr(M fe 2 _ 1 (/3,/3 ))] | H- 0. 

fc=i fe=i 
The last term is 0(A) and goes to zero. Applying Lemma 17.21 to 

X n , k = ^Nl^Mk^i^Po)) yields 

A 2 (a ,P,(3o) ->■ y / Tr(S(/3o,x ao (t))S (0, x ao (t)) - J p )di in P 0O -probability. Joining 

JO 

these two results we obtain that Ai(/3, /?o) + ^2(00 ; /3, /So) — >• -^2(ao> A) 5 /?) uniformly 

e,A— >o 

w.r.t. /3. 

It remains to prove that A 2 (a tA , P, A)) ~ A 2 (ctQ, A A)) ~ > in probability uniformly w.r.t. 
A Then, we use Lemma I4T21 to set h^{a, Oo) = e _1 ^ jVfc ^°' a ^ + ||a — ao||%)- Then, 

A 2 (a,(3,l3o) - A 2 (a ,/3,/3 ) = Ti(a,a , j3) + T 2 (a, a , /?), with 



Ti(a,a ,/3) = ^y^fc(o, Qq)S 1 (P,X tk _ 1 )h k (a,a ) and 



fc=i 

n 



T 2 (Q,a ,/3) = (a, ao,0) = ^^(a.aojr^^I^JjVtfl.ao). 

By Lemma 14.11 and 14.21 SU P a o)|| < -ftTe _1 ||a — ao|| which leads to 

ke{l,..,n} 

sup \T\(a e a, ciO; < ^A||e _1 (a e a — oo)|| 2 s'wp ||S _1 (/3, X^^JH. Applying Proposition 

14.21 yields that this term goes to zero. 

SU P ll^r *%(o<e,Ai cto)S _1 (/3] A^ _i )|| < fHI 6 " 1 (a e) A — ao)|| is bounded in Pg -probability 
fee{i,..,n} 

by Proposition 14.21 Finally, applying (j4.6j) ensures that ?2(a ei Ai cto 5 /?) - >• and the proof 
is achieved. 



7. 9. Proof of Theorem \4-l\ 

The asymptotic normality of e _1 (a e .A — «o) is obtained just adding the consistency 
result on A, A m the proof of Proposition 14.21 

Taylor expansion of dU gg A at point 9q = (^0) A)) 5 setting 

' e%(«o,/?o) \_(f 1 (M a M a A \ fe-\a eA - a )\ 
^^(a ,A)) " U W«,/J ^ J 1 * j V lvMA,A - A,) J 



Let us first study the asymptotic normality of A 
Setting M|_ 1 = E- 1 (fo,* th _ 1 )Jg(A,,.X tfc _ 1 ) and noting that 

n 

9 '° g(det(E ^' Xtfc - l))) = Tr(M l k _ 1 ), we obtain using definition -^^(a , A)) = XX 

k=l 

with ^| = -^TrtMl^) - ^/E^kfl^k-l) ■ Let us first a PP l Y Lemma O with 
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n 

E E [441^,-! 



fc=i 



TriMt^TriMi^) Tr(MLl) E 



k=l \ 

fe=i V 



^,o(^*-i)l^*-i 



e 2 A 



TriMi ,) r „ 

^ E [^.oMU)!^] 



"£E (?^ E [^0(^-1)^0(^-1)1^-1] - Tr{Mi_ x )Tr{M{_ x ) 2Tr(AP k 



k=l 



-lYTriMUMl,). 



k=l 



Using (|7.9j) and (17. lQf) . the first three summation terms go to zero, while the last one 
goes to AI a (ao, f3o)ij as a Riemann sum. Using that \fnA\ is bounded in probability 



yields that \\A\A{f = 0(4,), leading to E^ [(44) 2 |^-i 

ib=i 

n 

^2, X n,k -> 47 ff (a ,/3o)- In addition, (JESJ yields 



0. Thus, we obtain 



fc=l 



k=l v fc=l 

and sup 

l E [4l-?Vi] I °- Now > settin S 4 

feG{l,..n} 



,A->0 

4 - E [41^,-J leads 



[Jl^A^Ij^^j — >• AI a (a>Q, Pq). Using Taylor's stochastic formula 



fc=i 



4 = ^Tr(M* (s^ir^X^J -I p ] 

+e*XRl e (t k ) - J R, 2 ;;(t fc _ 1 ))M^- 1 (/3o,^ fc _ 1 )( J R^(t fe ) - < £ (i fe -i)). 
Using (I2TTT) and £□} yields that S k oA = S(/3 , x Qo (t fc -i) + O(A). Hence, 

0. Using Corollary O-(ii) yields E 



sup \A k | 2 

fe6{l,..n} 



< 00. 



sup |4| — )• 
fe£{l,..n} e,A->0 

We can now apply Theorem 3.2 p 58 in to the centered martingale A\ to obtain 

the asymptotic normality: -4= («o, /?o) - > A/" (0, 47 CT (o;o, A))) • 

n 

Let us now study the second derivatives of U £i A, ^ Qp.gp, ( a o, Po) = E/^fc^ w ^ n 



Tr{V k t x )-Tr{Mi^Ml_ x ) 



fc=i 

PT^o ( L l J -i " ^-1^-1 " M^MU) , and 



1 



Using (US]) we have | Ve [.B^lJvJ - -Tr(Afjj_ 1 Mj(_ 1 )| — ► 0. Moreover, 



k=l 

n 



||^|| 2 = 0(4», SO £e [(^') 2 |J" tfc _ 1 

n 

E 5 ^ 



fc=i 



fc=l 

2i cr (ao, A))i,j- 



0. Hence Lemma 17.21 yields 
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It remains to study ^§^j(«o,/3o) = witn > 



k=l 



i m^ {ao)M J k _ iE -i {PoiXtk _j Nk{x ^ y 



eA^n 



Posing VH X = ^-^^X^MUi^^iao), we have sup 



ke{l,..,n} 



and J^E [C^l^-i] = -I^X^E [N^ato)]?^]- Thus (HH leads to 



k=l 



k=l 



fc=l 

n 



0. Moreover, setting M^'f^ = nVl'\ v£'\ we apply (|7.9p to obtain 



J> [(Cf ) 2 |J tfe - 1 ] = ^E E Ko«-i)l^-J -> 0. Lemma O leads to 

k=l k=l 

n 



/c=i 



e,A->0 



0. The proof is then achieved. 
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